Synergies in learning words and their referents
نویسندگان
چکیده
This paper presents Bayesian non-parametric models that simultaneously learn to segment words from phoneme strings and learn the referents of some of those words, and shows that there is a synergistic interaction in the acquisition of these two kinds of linguistic information. The models themselves are novel kinds of Adaptor Grammars that are an extension of an embedding of topic models into PCFGs. These models simultaneously segment phoneme sequences into words and learn the relationship between non-linguistic objects to the words that refer to them. We show (i) that modelling inter-word dependencies not only improves the accuracy of the word segmentation but also of word-object relationships, and (ii) that a model that simultaneously learns word-object relationships and word segmentation segments more accurately than one that just learns word segmentation on its own. We argue that these results support an interactive view of language acquisition that can take advantage of synergies such as these.
منابع مشابه
A Bayesian Framework for Learning Words From Multiword Utterances
Current computational models of word learning make use of correspondences between words and observed referents, but as of yet cannot—as human learners do—leverage information regarding the meaning of other words in the lexicon. Here we develop a Bayesian framework for word learning that learns a lexicon from multiword utterances. In a set of three simulations we demonstrate this framework’s fun...
متن کاملThe ontogeny of lexical networks: toddlers encode the relationships among referents when learning novel words.
Although the semantic relationships among words have long been acknowledged as a crucial component of adult lexical knowledge, the ontogeny of lexical networks remains largely unstudied. To determine whether learners encode relationships among novel words, we trained 2-year-olds on four novel words that referred to four novel objects, which were grouped into two visually similar pairs. Particip...
متن کاملThe Ontogeny of Lexical Networks: Toddlers Encode the Relationships Among Referents When Learning
Although the semantic relationships among words have long been acknowledged as a crucial component of adult lexical knowledge, the ontogeny of lexical networks remains largely unstudied. To determine whether learners encode relationships among novel words, we trained 2-year-olds on four novel words that referred to four novel objects, which were grouped into two visually similar pairs. Particip...
متن کاملInfants' learning of novel words in a stochastic environment.
In everyday word learning words are only sometimes heard in the presence of their referent, making the acquisition of novel words a particularly challenging task. The current study investigated whether children (18-month-olds who are novice word learners) can track the statistics of co-occurrence between words and objects to learn novel mappings in a stochastic environment. Infants were briefly...
متن کاملPreschoolers’ flexible use of talker information during word learning
Previous research suggests that preschool-aged children use novel information about talkers’ preferences (e.g. favorite colors) to guide on-line language processing. But can children encode information about talkers while simultaneously learning new words, and if so, how is talker information encoded? In five experiments, children learned pairs of early-overlapping words (geeb, geege); a partic...
متن کامل